Benchmarking Scalable Predictive Uncertainty in Text Classification
نویسندگان
چکیده
This paper explores the question of how predictive uncertainty methods perform in practice Natural Language Processing, specifically multi-class and multi-label text classification. We conduct benchmarking experiments with 1-D convolutional neural networks pre-trained transformers on six real-world classification datasets which we empirically investigate why popular scalable estimation strategies ( Monte-Carlo Dropout , xmlns:xlink="http://www.w3.org/1999/xlink">Deep Ensemble ) notable extensions xmlns:xlink="http://www.w3.org/1999/xlink">Heteroscedastic xmlns:xlink="http://www.w3.org/1999/xlink">Concrete underestimate uncertainty. motivate that benefits from combining posterior approximation procedures, linking it to recent research ensembles variational Bayesian navigate loss landscape. find our proposed method combination by analysis in- domain calibration, cross-domain classification, novel class robustness, demonstrates superior performance, even at a smaller ensemble size. Our results corroborate importance fine-tuning dropout rate task hand, individually as an impacts model robustness. observe ablation severely underperform novelty detection, limiting applicability transfer learning when distribution shift classes can be expected.
منابع مشابه
Personalized News Categorization Through Scalable Text Classification
Existing news portals on the WWW aim to provide users with numerous articles that are categorized into specific topics. Such a categorization procedure improves presentation of the information to the end-user. We further improve usability of these systems by presenting the architecture of a personalized news classification system that exploits user’s awareness of a topic in order to classify th...
متن کاملSimple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
Deep neural networks (NNs) are powerful black box predictors that have recently achieved impressive performance on a wide spectrum of tasks. Quantifying predictive uncertainty in NNs is a challenging and yet unsolved problem. Bayesian NNs, which learn a distribution over weights, are currently the state-of-the-art for estimating predictive uncertainty; however these require significant modifica...
متن کاملEnsembles of Sparse Multinomial Classifiers for Scalable Text Classification
Machine learning techniques face new challenges in scalability to large-scale tasks. Many of the existing algorithms are unable to scale to potentially millions of features and structured classes encountered in web-scale datasets such as Wikipedia. The third Large Scale Hierarchical Text Classification evaluation (LSHTC3) evaluated systems for multi-label hierarchical categorization of Wikipedi...
متن کاملScalable text classification as a tool for personalization
We consider scalability issues of the text classification problem where by using (multi)-labeled training documents, we try to build classifiers that assign documents into classes permitting classification in multiple classes. A new class of classification problems; called ‘scalable’, is introduced, with applications on web mining. Scalable classification utilizes newly classified instances in ...
متن کاملBenchmarking short text semantic similarity
Short Text Semantic Similarity measurement is a new and rapidly growing field of research. “Short texts” are typically sentence length but are not required to be grammatically correct. There is great potential for applying these measures in fields such as Information Retrieval, Dialogue Management and Question Answering. A dataset of 65 sentence pairs, with similarity ratings, produced in 2006 ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2022
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2022.3168734